Adaptive parallelism in compiler-parallelized code

نویسندگان

  • Mary W. Hall
  • Margaret Martonosi
چکیده

As moderate-scale multiprocessors become widely used, we foresee an increased demand for eeective compiler parallelization and eecient management of parallelism. While parallelizing compilers are achieving success at identifying parallelism, they are less adept at predetermining the degree of parallelism in diierent program phases. Thus, a compiler-parallelized application may execute on more processors than it can eeectively use, a waste of computational resources that becomes more acute as number of processors increases, particularly for systems used as multiprogrammed compute servers. This paper examines the dynamic parallelism behavior of multiprogrammed workloads using programs from the Specfp95 and Nas benchmark suites, automatically parallelized by the Stanford SUIF compiler. Our results demonstrate that even the programs with good overall speedups display wide variability in the number of processors each phase (or loop) can exploit. We propose and evaluate a run-time system mechanism that dynamically adjusts the number of processors used by a compiler-parallelized application, responding to observed performance during the program's execution. Executing programs can thus adapt processor usage both to poor parallelism within certain parts of their code, and also to heavy multiprogramming loads during the execution. This mechanism improves workload performance up to 33% over consecutive standalone runs of each program.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatically Tuning Parallel and Parallelized Programs

In today’s multicore era, parallelization of serial code is essential in order to exploit the architectures’ performance potential. Parallelization, especially of legacy code, however, proves to be a challenge as manual efforts must either be directed towards algorithmic modifications or towards analysis of computationally intensive sections of code for the best possible parallel performance, b...

متن کامل

Exploring the Use of Hyper-Threading Technology for Multimedia Applications with Intel® OpenMP* Compiler

Processors with Hyper-Threading technology can improve the performance of applications by permitting a single processor to process data as if it were two processors by executing instructions from different threads in parallel rather than serially. However, the potential performance improvement can be only obtained if an application is multithreaded by parallelization techniques. This paper pres...

متن کامل

AESOP: The Autoparallelizing Compiler for Shared Memory Computers

An automatic parallelizer is a tool that converts serial code in C, C++ and Fortran to parallel code. This is an important problem because most hardware today is parallel and manually rewriting the vast repository of serial code is tedious and error prone. We have developed an automatic parallelizing compiler for source code, AESOP targeting shared memory machines. AESOP leverages the LLVM infr...

متن کامل

MaxPar Extensions for Isolating Performance Problems

Many applications are too large for easy hand optimization. The Parallel Performance Analyst is a tool for automatically characterizing and identifying the regions of the application most deserving of programmer attention. Among its components are interactive and traced-based prooling tools, and the MaxPar execution-driven simu-lator, which computes the amount of parallelism inherent in a code,...

متن کامل

Automatic Resource-Constrained Static Task Parallelization

This thesis intends to show how to efficiently exploit the parallelism present in applications in order to enjoy the performance benefits that multiprocessors can provide, using a new automatic task parallelization methodology for compilers. The key characteristics we focus on are resource constraints and static scheduling. This methodology includes the techniques required to decompose applicat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Concurrency - Practice and Experience

دوره 10  شماره 

صفحات  -

تاریخ انتشار 1998